Document Identiication for Copyright Protection Using Centroid Detection

نویسندگان

  • N. F. Maxemchuk
  • A. M. Lapone
چکیده

A way to discourage illicit reproduction of copyrighted or sensitive documents is to watermark each copy before distribution. A unique mark is embedded in the text whose recipient is registered. The mark can be extracted from a possibly noisy illicit copy, identifying the registered recipient. Most image marking techniques are vulnerable to binarization attack and hence not suitable for text marking. We propose a diierent approach where a text document is marked by shifting certain text lines slightly up or down or words slightly left or right from their original positions. The shifting pattern constitutes the mark and is diierent on diierent copies. In this paper we develop and evaluate a method to detect such minute shifts. We describe a marking and identiication prototype that implements the proposed method. We present preliminary experimental results which connrms the analytical prediction that centroid detection performs remarkably well on line shifts even in the presence of severe distortions introduced by printing, photocopying, scanning, and facsimile transmission.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Comparison of Two Text

A text document typically consists of a collection of regular structures such as words, lines and paragraphs, a slight movement of which seems less perceptible than, say, dithering of the document image. In this paper we exploit this property to watermark formatted text documents by shifting slightly certain lines and words, in order to discourage illicit distribution. We analyze two methods fo...

متن کامل

Document identification for copyright protection using centroid detection

A way to discourage illicit reproduction of copyrighted or sensitive documents is to watermark each copy before distribution. A unique mark is embedded in the text whose recipient is registered. The mark can be extracted from a possibly noisy illicit copy, identifying the registered recipient. Most image marking techniques are vulnerable to binarization attack and, hence, not suitable for text ...

متن کامل

Robust Watermarking of Still Images for Copyright Protection

Digital watermarking has been proposed as a mean to protect the copyright of multimedia data in a networked environment, since it makes possible to tightly embed a code into a digital document allowing the identiication of the data owner. In this paper a new watermarking system for digital images is presented: the method embeds a sequence of random real numbers in a selected set of DCT coeecien...

متن کامل

Centroid-based summarization of multiple documents

We present a multi-document summarizer, MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. We describe two new techniques, a centroid-based summarizer, and an evaluation scheme based on sentence utility and subsumption. We have applied this evaluation to both single and multiple document summaries. Finally, we describe two user studies tha...

متن کامل

حمایت از حق مؤلف در فضای سایبر در حقوق ملی و اسناد بین‌المللی

  Development of information technology and entrance to digital millennium confronted Copyright system with some serious challenges so that in some cases, protection of creators of digital works and protection of artistic and literary works in digital and cyber space and performance of this works in that space is in doubt. In order to removing this concerns and protection of copyright a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007